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Index generation method for database management system, involves 
presenting associative access of generated combined index of 
unstructured and structured data columns to database using query 
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Abstract (Basic) : US 20030033275 Al 

NOVELTY - One or more unstructured data columns including image 
data, video/audio data , text and, structured data columns are 
identified from a database. A combined index of unstructured" and 
structured data columns is generated, and associative access is 
presented to the database using a query including both structured and 
unstructured conditions . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) database management system; and 

(2) data table search method. 

USE - For generating yindex for database management system (DBMS) 
(claimed) , relational database management system (RDBMS) in computer 
system connected to network, such as Internet, intranet, wide area 
network (WAN) , local area network (LAN) , data communication network, 
etc . 

ADVANTAGE - Data retrieval for queries having combined 
structured and unstructured conditions are improved. The amount of 
data fetched to determine relevant rows is reduced, hence the memory 
used for caching is utilized better. Response time for queries is 
reduced. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the database management system, 
pp; 11 DwgNo 1/4 

Title Terms: INDEX; GENERATE; METHOD; DATABASE; MANAGEMENT; SYSTEM; PRESENT 
; ASSOCIATE; ACCESS; GENERATE; COMBINATION; INDEX; UNSTRUCTURED; 
STRUCTURE; DATA; COLUMN; DATABASE; QUERY 

Derwent Class: T01 

International Patent Class (Main) : G06F-007/00 
File Segment: EPI 

21/5/10 (Item 10 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 

(c) 2004 Thomson Derwent. All rts. reserv. 

015088655 **Image available** 

WPI Acc No: 2003-149173/200314 

XRPX Acc No: N03-117728 

Signature verifying method for e- commerce applications where contact 
contains multimedia data and unstructured data and where signatory 
can use voice to sign contract 

Patent Assignee: TELEFONAKTIEBOLAGET ERICSSON L M (TELF ) 

Inventor: BOGESTAM K; TORVINEN V M 

Number of Countries: 101 Number of Patents: 002 



Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

WO 200307133 Al 20030123 WO 2002FI618 A 20020708 200314 

EP 1407340 Al 20040414 EP 2002745460 A 20020708 200426 

WO 200-2FI618 A 20020708 



Priority Applications (No Type Date) : FI 20011498 A 20010709 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
WO 200307133 Al E 39 G06F-001/00 



Designated States 


(National ) : 


AE 


AG 


AL 


AM 


AT 


AU 


AZ 


BA 


BB 


BG 


BR 


BY 


BZ 


CA 


CH CN CO CR CU CZ 


DE DK DM DZ 


EC 


EE 


ES 


FI 


GB 


GD 


GE 


GH 


GM 


HR 


HU 


ID 


IL 


IN 


IS JP KE KG KP KR 


KZ LC LK LR 


LS 


LT 


LU 


LV 


MA 


MD 


MG 


MK 


MN 


MW 


MX 


MZ 


NO 


NZ 


OM PH PL PT RO RU 


SD SE -sxr SI 


SK 


SL 


TJ 


TM 


TN 


TR 


TT 


TZ 


UA 


UG 


US 


UZ 


VN 


YU 


ZA ZM ZW 
































Designated States 


(Regional) : 


AT 


BE 


BG 


CH 


CY 


CZ 


DE 


DK 


EA 


EE 


ES 


FI 


FR 


GB 


GH GM GR IE IT KE 


LS LU MC MW 


MZ 


NL 


OA 


PT 


SD 


SE 


SK 


SL 


SZ 


TR 


TZ 


UG 


ZM 


ZW 


1407340 Al E 


G06F-001/00 


Based on patent WO 200307133 








Designated States 


(Regional) : 


AL 


AT 


BE 


BG 


CH 


CY 


CZ 


DE 


DK 


EE 


ES 


FI 


FR 


GB 


GR IE IT LI LT LU 


LV MC MK NL 


PT 


RO 


SE 


SI 


SK 


TR 



















Abstract (Basic) : WO 2003007133 Al 

NOVELTY - Content provider (CP) presents a contract to signatory in 
that includes at least two of the following: audio, images, video or 
unstructured - text . The • signatory signs by speaking to accept contract 
which is returned to the CP. The CP sends contract with unstructured 
data and charged amount to payment provider. Payment provider stores 
the voice file and deducts the amount from the signatory ! s account. 
DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for: 

(1) A system for verifying electronic data 

(2) An electronic device for signing documents 

USE - Electronic signatures in e-commerce applications. 

ADVANTAGE - Can contain data not normally included in electronic 
contracts such as unstructured text, audio and video data. Overcomes 
problems when size of data to be signed is too large for screen. 

DESCRIPTION OF DRAWING (S) - The drawing shows a flow diagram of the 
invention 
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Abstract (Basic) : WO 200072452 A2 

NOVELTY - A protocol having sequence of structured triggers and 
responses is selected for converting unstructured information to 
structured information .^frhe structured information and associated 
unstructured information are stored in a database (132) . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) apparatus with computing element capable of selecting a 
protocol ; 

(b) method of doing business and providing service to customers 
using computer protocol; 

(c) apparatus for providing service to customers using computer 
protocol 

USE - For remote medical treatment, E-commerce applications, 
classification of essays or. speeches,, written material resumes, 
educational or training services, emergency services, legal services, 
medical services, opinion poll services, physical therapy services, 
police or fire services, psychological services, registration services, 
services for collecting information, services for providing 
information, services for responding to complaints or workers 
compensation services. 

ADVANTAGE - Since the unstructured information are converted to 
structured information', classification of the information depending 
on user's need is enab^eqjk J 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
system for converting unstructured information into structured 
information - 
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Abstract (Basic) : WO 200135265 A2 

NOVELTY - The method provides mobile information processing and 
groups objects into domain and provides dictionary of descriptions of 
objects in domain, and identifying descriptions in structured or 
unstructured data , and extracts identified descriptions from this 
data. The interactive device includes web browsers, computer or 
television screens, wireless cell phone screens, etc. 

DETAILED DESCRIPTION, An independent claim describes a computer 
implemented method of managing a collection of unique objects. 

USE - As a method for providing mobile information processing and 
grouping objects into a domain. 

ADVANTAGE - Maintains consumer attention so it is not squandered 
by either consumers or businesses. 

DESCRIPTION OF DRAWING (S) - The drawing shows a block diagram of 
the mobile information system. 

pp; 29 DwgNo 2/8 

Title Terms: METHOD; MOBILE; INFORMATION; PROCESS; GROUP; OBJECT; DOMAIN; 

DICTIONARY; DESCRIBE; OBJECT; DOMAIN; IDENTIFY; DESCRIBE; STRUCTURE; 

UNSTRUCTURED; DATA- . - 

Derwent Class: T01 

International Patent Class (Main) : G06F-017/00; G06F-017/30 
File Segment: EPI 



21/5/17 (Item 17 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 

(c) 2004 Thomson Derwent. A^l^ rts . reserv. 

013023972 **Image available** 
WPI Acc No: 2000-195823/200017 
XRPX Acc No: N00-144863 

Unstructured supplementary service data transfer, setting method in 



cellular communication system, involves setting mobile station to call 
mode for transfer in fast ^Jiannel if mobile station is not involved in 
call ^ 

Patent Assignee: NOKIA NETWORKS OY (OYNO ) 
Inventor: TARN AN EN T; YLIHARJU S 

Number of Countries: 087 Number of Patents: 004 
Patent Family: 

Kind Date Applicat No Kind Date Week 

B 



Patent No 


Kind 


Date 


Applicat No 


Kind 


Date 


Week 


WO 


200008885 


Al 


20000217 


WO 99FI649 


A 


19990803 


200017 


FI 


9801692 


A 


20000204 


FI 981692 


A 


19980803 


200022 


AU 


9951665 


A 


20000228 


AU 9951665 


A 


19990803 


200030 


FI 


107503 


Bl 


20010815 


FI 981692 


A 


19980803 


200149 



Priority Application 
Patent Details: 
Patent No Kind Lan 
WO 200008885 Al E 
Designated States 
CR CU CZ DE DK EE 
KZ LC LK LR LS LT 
SK SL TJ TM TR TT 
Designated States 
IE IT KE LS LU MC 
FI 9801692 A 
AU 9951665 A 
FI 107503 Bl 



s (No Type Date) : ' FI' "981692 A 19980803 
Filing Notes 



Pg Main IPC 

15 H04Q-007/38 
(National) : AE AL AM AT AU AZ 
ES FI GB GD GE GH GM HR HU ID 
LU LV MD MG MK MN MW MX NO NZ 
UA UG US.UZ VN YU ZA ZW 
(Regional ): AT BE CH CY DE DK 



BA BB 
IL IN 
PL PT 



BG 
IS 
RO 



BR 
JP 
RU 



BY CA 
KE KG 
SD SE 



CH CN 
KP KR 
SG SI 



EA ES FI FR GB GH GM GR 



MW NL.t)A PT SD SE SL SZ UG ZW 
H04Q-000/00 

H04Q-007/38 Based on patent WO 200008885 
H04Q-007/38 Previous Publ . patent FI 9801692 



Abstract (Basic) : WO 200008885 Al 

NOVELTY - When amount of determined data to be transmitted exceeds 
predetermined threshold and if mobile station (MS) is not involved in 
call, the MS in directed to call mode by initiating unsuccessful call 
attempt, for switching unstructured supplementary service data 
transfer (USSD) transfer to fast channel (FCCH) . 

DETAILED ' DESCRIPTION -An INDEPENDENT CLAIM is also included for 
USSD transfer setting apparatus of mobile station. 

USE - For setting USSD transfer in cellular communication system 
e.g. global system for mobile communication. Such as cellular ratio 
telephone. Also in mobile services switching center, USSD centers, home 
location registers and visitor location registers of cellular 
communication system and service control points of intelligent 
networks . 

ADVANTAGE - By switclying communication through fast channel, based 
on data amount, the time" required for transmission is reduced. 

DESCRIPTION OF DRAWING ( S ) - The figure shows signaling diagram 
illustrating mobile-originated USSD transfer. 

Mobile station (MS) 
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Abstract (Basic) : DE 19828334 A 

The method involves storing data in structured tabular form, 
consisting of columns, i ^e^ fields, and lines, i.e. data records, so 
that each cell represents an object. Each data record of the table is 
marked with a unique master key. The data records in different tables 
can be associated with each other in various relationships, e.g. one to 
one, one to many, many to many, so that complex virtual user systems 
can be produced. 

The connection between data records results through connecting 
keys, which are laid out as external keys. Structured information 
of homogeneous objects are entered in tables which are all to be stored 
in a database file, i.e. user database. Not structured and extended 

information of these objects are stored outside the user database. 
Pointers, i.e. addresses, to the exported information are indicated in 
corresponding fields of tine structured 'tables. 

ADVANTAGE - Provides structured view of otherwise unstructured 
information . Enables faster access to required information from 
internet . 
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data 

Patent Assignee: ERICSSON INC (TELF ) 
Inventor: VALENTINE E L 

Number of Countries: 074 Nui^>er of Patents: 007 
Patent Family: 
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Abstract (Basic) : WO 9730556 A 

The binary data communication system includes a server application 
module which is associated with one node. The module includes a 
conversion function module for formatting the binary data to character 
based data. A sending function module which is connected to the 
conversion function module, transmits the formatted data to a second 
. node . ^ ...... . . . 

A client application module is associated with the second node and 
includes a reception function module for receiving the formatted 
character based data. A second conversion function module converts the 
character based data back to the binary data, representing a graphic 
image. A digital communication link is used to communicate messages 
containing the character based data between the two nodes. 

USE/ADVANTAGE - E.g. for routing binary data between first and 
second nodes within GSM or J PCS mobile communications system. Does not 
require modification ofvdurrent PCS or GSM protocols or introduction of 
new transport protocol. 
Dwg. 6/9 

Title Terms: BINARY; DATA; COMMUNICATE; SYSTEM; DATA; REPRESENT; GRAPHIC; 

IMAGE; CONVERT; BINARY; DATA; CHARACTER; BASED; DATA; TRANSMISSION; 

UNSTRUCTURED; SUPPLEMENTARY; SERVICE; DATA; MESSAGE; TRANSMIT; MOBILE; 

TERMINAL; DATA; RECONVERSION; BACK; BINARY; DATA 
Derwent Class: T01; W01; W02 
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Self correcting sync, signal^ method for communication system - generating 
cycle start signals, transmitting on interconnect, receiving and 
extracting timing data to generate local cycle sync. 

Patent Assignee: APPLE COMPUTER INC (APPY ) 

Inventor: JAMES D V; TEENER M D 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 
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Priority Applications (No Type Date) : US 90505334 A 19900405 

Abstract (Basic) : US 5052029 A 

The sync, signal provision method comprises the steps of waiting a 
predetermined period of time (nominal cycle period) and after the 
predetermined period of time, waiting a further period of time for the 
absence of communications on the interconnect among any of the units. A 
count of the passage of time is formed during this further period of 
' time (start delay) . A synchronisation' signal including the count of the 
further period of time is then formed. 

The synchronisation signal is then sent on the interconnect. The 
synchronisation signal is received and the receiving units count of 
time is updated by the predetermined period of time plus the count of 
the further period of time. 

ADVANTAGE - Does not require separate dedicated sync, channel or 
path, is readily adaptable to unstructured data and provides highly 
accurate synchronisation ^ A 7pp Dwg.No.3/3) 
Title Terms: SELF; CORRECT^SYNCHRONOUS ; SIGNAL; METHOD; COMMUNICATE; 

SYSTEM; GENERATE; CYCLE; START; SIGNAL; TRANSMIT; INTERCONNECT; RECEIVE; 
EXTRACT; TIME; DATA; GENERATE; LOCAL; CYCLE; SYNCHRONOUS 
Derwent Class: W01 
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Title: Predicting values and trends from tables and graphs 

Author: Shamo, Marcia Kuskin; Meyer, Joachim; Gopher, Daniel 
Corporate Source: Technion - Israel Inst of Technology, Haifa, Isr 
Conference Title: Proceedings of the 1996 4 0th Annual Meeting of the 

Human Factors and Ergonomics Society. Part 2 (of 2) 

Conference Location: ^^Philadelphia, PA, USA Conference Date: 

19960902-19960906 

E.I. Conference No.: 45349 

Source: Proceedings of the Human Factors and Ergonomics Society v 2 1996. 
Human Factors and Ergonomics Society, Inc., Santa Monica, CA, USA. p 
1151-1154 

Publication Year: 1996 

CODEN: PHFSDQ . ISSN: 0163-5182 

Language: English 

Document Type: CA; (Conference Article) Treatment: X; (Experimental) 
Journal Announcement: 9611W4 

Abstract: The issue of. optimal display .design was re-examined .under - the 
hypothesis that the nature of the data set would influence the efficacy of 
displays. An experiment assessed the effect of data structures on the 
relative efficacy of line graphs and tables. Participants saw a sequence of 
graphic or tabular displays which presented data from functions. The 
displays showed either values taken from the simple form of a single 
sinusoid function ( structured data conditions), or from the complex and 
seemingly unstructured form of five summed sinusoid functions ( 
unstructured data conditions). After seeing four consecutive displays 
participants were asked to <preclict the behavior of the data set in the next 
display which they had not yet seen. They were required to predict either 
the behavior of specific points, or the direction of trends. While the 
performance of the point comparison task was not influenced by display 
format, graphs were found to have an advantage for the prediction of future 
trends. Graphs also led more frequently to the identification of structure 
than did tables, both for structured and unstructured conditions. (Author 
abstract) 11 Refs. 

Descriptors: Human engineering; Display devices; Graph theory; 
Forecasting; Sensory perception; Data structures 

Identifiers: Tabular displays; Sinusoid function; Optimal display design 

Classification . Codes : 

461.4 (Human Engineering); 722.2 (Computer Peripheral Equipment); 921.4 
(Combinatorial Mathematics, Includes Graph Theory, Set Theory); 723.2 
(Data Processing) 

461 (Biotechnology); 722 (Computer Hardware) ; 921 (Applied 
Mathematics); 723 (Computer Software) 

46 (BIOENGINEERING) ; 72 (COMPUTERS & DATA PROCESSING); 92 (ENGINEERING 
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Title: Choosing a problem. When/\.s artificial intelligence appropriate 
for the retail industry. 

Author: Murdoch, Helen 

Corporate Source: Aberystwyth Uffiv, Waives 
Source: Expert Systems v 7 n 1 Feb 1990 p 42-49 



Publication Year: 1990 

CODEN: EXSYEX ISSN: 0266-4720 

Language: English 

Document Type: JA; (Journal Article) Treatment: A; (Applications); E; 
(Economic/Cost Data/Market Survey); M; (Management Aspects) 
Journal Announcement: 9101 \ 

Abstract: Current methodologies\ f or choosing problems suitable for 
implementation' as IKBS's'are "not \ *appro*priate for the retail industry, due 
to singularities of the domain. InXparticular , J^ney exclude from 
consideration a kind of system which uses Electronic Point of Sale (EPOS) 
data to structure unstructured n^anagerAal perceptions, but several 
examples of these systems have been sikce/s fully implemented. Methodologies 
may be more domain specific than they w^(e thought to be, and may hold back 
implementations to an artificially imposec^level of difficulty. 
Nonetheless, analysis of the level of/struchnre found in a problem is 
useful, as it points out possible trouble spoNts, and Fuzzy/Complex analysis 
was felt to be the most suitable methodology for retail due to its 
manufacturing origins, clarity of/purpose, and\f lexibility . (Author 
abstract) 19 Refs. / \ 

Descriptors: * ARTIFICIAL INT^LIGENCE— *Applica\ions ; ECONOMICS; 
MARKETING; MANAGEMENT; COMPUTER AIDED ANALYSIS; DA^BASE SYSTEMS 

Identifiers: RETAIL INDUSTRY; ELECTRONIC POINT OF SALE (EPOS); MARKET 
RESEARCH; FUZZY/ COMPLEX ANAI/YSIS 

Classification Codes: / 

723 (Computer Software)/; 911 (Industrial Economics); 912 (Industrial 
Engineering & Managements 

72 (COMPUTERS & DATA PROCESSING); 91 (ENGINEERING MANAGEMENT) 
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Title: SOME USEFUL DATA STRUCTURES FOR THE GENERATION OF UNSTRUCTURED 
GRIDS . ^ j 

Author: Lohner, Rainald 

Corporate Source: Berkeley Research Associates, Springfield, VA, USA 

Source: Commun Appl Numer Methods v 4 n 1 Jan- Feb 1988 p 123-135 

Publication Year: 1988 

CODEN: CANMER ISSN: 0748-8025 

Language: ENGLISH 

Document Type: JA; (Journal Article) Treatment: A; (Applications); T; 
(Theoretical) 

Journal Announcement: 8804 

Abstract: Several data structures for the generation of unstructured 
grids are described. Their usefulness stems from the fact that they enable 
the necessary search operations to be performed in an optimal way. In" 
particular, we describe heap lists, quad- and octrees, and linked lists 
. Combining these data structures , the important problem of 
interpolating information between unstructured grids is also solved. 
(Author abstract) 20 refs. 

Descriptors: DATA PROCESSING—* Data Structures ; COMPUTER PROGRAMMING 
--Algorithms; COMPUTER SIMULATION; MATHEMATICAL TECHNIQUES- -Finite Element 
Method; AERODYNAMICS— Wings and Airfoils; FLOW OF FLUIDS— Bubble Formation 

Identifiers: FRONTAL GRID GENERATION SCHEMES; UNSTRUCTURED GRID 
GENERATION; HEAP LISTS; QUfuOTREES ; OCTREES 

Classification Codes: 

723 (Computer Software); 631 (Fluid Flow & Hydrodynamics); 921 
(Applied Mathematics) ; 651 (Aerodynamics) 
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Title: INTEGRATED SYSTEMS FOR TRANSACTIONAL PROCESSING AND DECISION 
SUPPORT . 

Author: Lee, Daniel T. 

Corporate Source: Pan American Univ, USA 

Source: International* Journal *on' Policy and Information v 9 n 2 Dec 15 
1985 p 1-13 

Publication Year: 1985 
CODEN: IJPIDH 
Language: ENGLISH 

Document Type: JA; (Journal Article) Treatment: T; (Theoretical) 
Journal Announcement: 8607 

Abstract: Since technological advancement has recently reached to an 
unprecedented level, individual technologies in producing information for 
decision making are in existence. The most urgent concern is to have all 
these individual technologies integrated into a unified whole. With such 
integrated systems, comprehensive information can be produced not only 
for transactional processing, but also for decision-making. Modern decision 
makers may need a wide spectrum of information in their daily work such as 
structured data , unstructured data , audio, image, and text data. Now 
the problem is how to integrate all the information producing constructs 
into a unified whole. The purpose of this paper is to pursue an intensive 
investigation on the issue of integration systems. The main emphasis is on 
the technologies which can be used for system integration and how they can 
be integrated for enhancing the productivities of professional, clerical 
people/ and executives. -Since <the payoff is i much higher for executives, 
special emphasis is placed on the integration methodologies for producing 
information for decision making. (Edited author abstract) 34 refs. 

Descriptors: * MANAGEMENT — ^Information Systems; DECISION THEORY AND 
ANALYSIS — Computer Applications; COMPUTER NETWORKS; SYSTEMS ANALYSIS 

Identifiers: INTEGRATED SYSTEMS; TRANSACTIONAL PROCESSING; DECISION 
SUPPORT 
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912 (Industrial Engineering^ Management); 723 (Computer Software) 
91 (ENGINEERING MANAGEMENT) ; 72 (COMPUTERS & DATA PROCESSING) 



24/5/9 (Item 9 from file: 8) 

DIALOG (R) File 8 : Ei Compendex(R) 

(c) 2004 Elsevier Eng. Info. Inc. All rts . reserv. 

00535715 E.I. Monthly No: EI7605031784 E.I. Yearly No: EI76036877 

Title: INFORMATION SYSTEMS FOR POLICY MAKERS IN THE HEALTH CARE SYSTEM. 

Author: Gustafson, David H.; Thesen, Arne 
Corporate Source: Univ of Wis, Madison 

Source : Management - Da tamatics* v 5* n 1 -Feb '1976 p 31-43 

Publication Year: 1976 

CODEN: MADADG 

Language: ENGLISH 

Journal Announcement: 7605 

Abstract: It is argued that while traditional transaction or status 
reporting information systems serve many useful and necessary functions, 
they do not fully meet the needs of policy makers. Empirical evidence 
gathered from a delphi study io| health manpower professionals supports this 
argument. The most important 'questions these policy makers wanted answers 
to before setting policies were those of the " what if " kind, requiring 
projections of the impact of actions into the future. It is concluded that 
a new approach to policy information systems must be taken. Successful 
systems should not only access structured data bases of the status or 
transaction reporting kind; they should also access formal predictive 
models (based on such data) as well as unstructured data available in 
the forms of expert opinions, hunches, value judgments, etc. In such an 
information system the role of the systems analyst is expanded to not 
only include systems design and implementation but also to include the 
design and implementation of access methods for unstructured information 
. 23 refs. 
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A novel method and device for creation and management of data and 
information in electronic networks 

Author: Sharma, San jay 
Degree: Ph.D. 
Year: 2001 

Corporate Source/Institution: North Carolina State University (0155) 
Chair: Samuel C. Winchester 

Source: VOLUME 62/06-B OF DISSERTATION ABSTRACTS INTERNATIONAL. 

PAGE 2799. 143 PAGES 
Descriptors: COMPUTER SCIENCE 

Descriptor Codes: .0984 , . 

ISBN: 0-493-30112-7 

This research is an effort to understand the special needs of business 
managers who are increasingly expected to use computers and information 
technology in their business strategy and execution, but have little or no 
background in computers. The work is especially geared to understanding how 
the needs of such business managers are converted into a Requirements 
Specification Document by the ' Requirements Engineers, from which the 
information technology syst^rVs' 7 and solutions are designed. 

It is generally hard -to determine the real needs of the business 
managers due to a myriad of issues, the most important of which is the 
presence of ambiguity, uncertainty, and the ability of managers to 
routinely harbor contradictions in their decision-making process till they 
have to be resolved either way. Further, the perspective of the manager, 
and their assimilation of data and information depends upon whether they 
are decomposing a system to understand it (top-down) or building the system 
up from its pieces (bottom-up) . As the decision making process is itself 
complex, the tools that are desirable for the managers are those that are 
simple ( not simplistic) and ideally they should be almost transparent to 
the managers. . ............ , 

We have developed a machine that can help the managers deal with 
structured and unstructured data and add structure to unstructured 

data by an iterative process, and the machine is called the 
Sharma-Winchester (S-W) Machine. To validate the machine, we developed 
various versions of the S-W Computer based on the Machine, but due to the 
conf oundedness of the data, tool, and users, could not draw quantitative 
conclusions about the effectiveness of the Machine. The S-W Machine is 
based on the Graphical User Interface in preference to the Character User 
Interfaces, uses standard fodders and files (esp. archives like zip files 
that can save the path information about the files), and is based on a 
hierarchical storage design. We have shown how to implement the core 
elements of the machine, i.e., the input, output, and storage, but were 
unable to quantitatively test it because it was impossible to isolate just 
the design of the machine from its implementation as a S-W Computer during 
the case studies in which we tried to validate the device. But we believe 
that the machine is based on sound principles, and would be an effective 
contribution to the tools that are available for the managers to enhance 
their data and information related performance. 
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A PARAMETRIC ANALYSIS OF STRUCTURED AND UNSTRUCTURED Q-SORT DATA 
Author: HEAVLIN, WILLIAM D. 
Degree: PH.D. 
Year: 1980 

Corporate Source/Institution: STANFORD UNIVERSITY (0212) 
Source: VOLUME 41/08-B OF DISSERTATION ABSTRACTS INTERNATIONAL. 

PAGE 3092. 75 PAGES 
Descriptors : STATISTICS 
Descriptor Codes: 0463 

The operation of sorting the items of a Q-set according to their 
similarity to a given object is idealized by a system of axioms. As a 
consequence of this axiom system, a stochastic model of Q-sorting behavior 
is derived. This model, with its associated axioms, resembles in some 
respects the preference model of Luce; the two are compared at length. 

This Q-sort model is easily embedded in an exponential family. The 
domain of this family is the discrete space of all permutations of the 
items in the Q-set while its vnatural parameter space consists of the 
coefficients of a predictive linear relation. To operationalize the model, 
not only must predictor variables be specified, so almost must a design 
matrix describing the underlying structure of the Q-set be furnished. Such 
a construction occurs naturally for structured Q-sort data , but must be 
derived empirically for unstructured Q-sort data . Some empirical 
methods for deriving a description of the underlying structure of the Q-set 
are developed in the context of a "factor analysis" model. 

The normalizing constant of this exponential family is impossible 
to compute; the calculation of a useful surrogate for this constant is the 
primary technical problem of this work. The solution posed considers a 
conditional likelihood whose normalizing constant is calculated over an 
intuitively appealing subgroup' of permutations. A computer implementation 
of an algorithm for maximizing such a conditional likelihood model is 
described. 
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On Z39.50 wrapping and description logics. 
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Corporate Source: University of Toronto, ON, Canada M5S 3G4 ; Institute of 
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International Journal on Digital Libraries vol. 3, no. 3, pages 208-220 

Publication Date: October 2000 

ISSN: 1432-5012ISSN (electronic) : 1432-1300 

Journal 'URL: http : / /link*, springer . de/lin'k/ service/ journals/ 00799/ index . htm 

Publisher URL: http://www.springer-ny.com 

Language: English 

Document Type: Journal Article 

Record Type: Abstract 

Journal Announcement: 3707 

Z39.50 is a client/server protocol widely used in digital libraries and 
museums for searching and retrieving information spread over a number of 
heterogeneous sources. To p>ercome semantic and schematic discrepancies 
among various data sources, the protocol relies on a world view of 
information as a flat list of fields, called Access Points (AP) . One of the 
major issues for building Z39.50 wrappers mapping this unstructured list 
of APs to the underlying source data structure and semantics. For 



highly structured sources (e.g., database management systems, knowledge 
base systems), this mapping^s 7 quite complex and considerably affects the 
quality of the retrieved data. Unfortunately, existing Z39.50 wrappers have 
been developed from scratch and do not provide high-level mapping 
languages with verifiable properties. Proposes a description logic 
(DL) -based toolkit for the declarative specification of Z39.50 wrappers, 
claiming that the conceptualization of AP mappings enables a formal 
validation of the query translation quality (e.g., ill-defined mappings, 
inappropriate APs, etc.) and allows one to tackle a number of Z39.50 
pending issues (e.g., metadata retrieval, query failures due to unsupported 
APs, etc.) Further, the DL-based approach allows the development of Z39.50 
wrappers enriched with a number of added-value services such as conceptual 
structuring of . flat Z39 ..50 . voqabulaxies and .intelligent Z39.50 query . 
assists, services that are quite useful for profile developers, Z39.50 
wrappers administrators, -and end-users., . _ .. . „ . ......... 

Descriptors: Standards; Client server systems; Access points; Query 
refinement 

Classification Codes and Description: 2.4 (Standards and protocols) 
Main Heading: Knowledge Organization 
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Order out of chaos : trends in search . 

Author(s): Woods, Eric; Pelz-Sharpe, Alan; Vaughan, Dean 
Imaging & Document Solutions vol. 10, no. 3, pages 22-24 
Publication Date: March 1, 2001 
Language : English 

Document. Type :. Journal Article, .... ...... . . 

Record Type: Abstract 
Journal Announcement: 37 04 

Discusses trends in search and information retrieval technologies. 
Explains that the increasing importance of mobile access' to corporate data 
and applications will raise new issues for users and new opportunities for 
suppliers. Mentions that going beyond simple keyword and Boolean searches, 
most information retrieval products offer the following advanced tools and 
approaches: elementary lingujhstic analysis; thesaurus capabilities; pattern 

matching tools; semantic' analysis tools; document categorization or 
clustering; personalization; and user profiling. Says that categorization 
tools are designed to organize large bodies of unstructured information 
into a form that is easy to navigate. Points out that in theory, 
supporting information retrieval over mobile devices is not a major 
challenge, in light of the fact that the difficult part of retrieval has 
always been the indexing. 

Descriptors: Information resources management; information retrieval; 

Mobile communications 
Classification . Codes . and Description : . 5 .X (Information and knowledge . 

management); 7.8 (Telecommunications) 
Main Heading: The Information Industry; Information Technologies 
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Three waves of information' portals for knowledge management. 

Author (s): Morris, Henry 
KM World vol. 9, no. 6, pages 8-9 
Publication Date: July 1, 2000 
Document Type: Journal Article 



Record Type: Abstract J 
Journal Announcement: 37 02- 



Focuses on the need for supporting infrastructure in unified access to 
information, noting that simply providing access under a single portal does 

not solve some of the key problems, such as structure and format. 
Mentions several partial solutions, but points out that they are still 
flawed. Suggests that though portals are still not meeting the needs of 
today's customers, they are expected to be the solution of the future. 
Explains that the evolution of portals can lead to integrated access to 
heterogenous types of data. Overviews three "waves" of the portal 
revolution: user interf ace^led integration; separate-but- equal, access to 
structured and unstructured data ; and unified structured and 
unstructured data access. Claims the future should include tracking how 
data is used, which will improve collaboration and provide means . for a 
smart push of information to users. 

Descriptors: Information resources management; Knowledge; Organizations; 
Gateways 

Classification Codes and Description: 5.1 (Information and knowledge 

management) J 
Main Heading: The Information Industry 
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Integrated data bases for municipal decision -making. 

Book Title: In American Federation Of Information Processing Societies. 
Afips Conference Proceedings . ^Volume 44. 1975 National Computer Conference, 
May 19-22, 1975, Anaheim, California. P." 487-493. 1 Illus. 19 Ref. See Isa 
76-009/y. 

Author(s): Carlson, Eric D 

Corporate Source: Ibm Research Laboratory, San Jose, California; Mantey, 

Patrick E. 
Publication Date: 1975 
Language: English 
Document Type: Book Chapter 
Record Type: Abstract J 
Journal Announcement: 1100- 

The development of information for decision-making in municipalities 
requires integration of data from the various operational files which are 
generated in local government. Even when an integrated municipal data base 
does not exist, it is possible to develop integrated data from properly 

structured source files in conjunction with a well-maintained reference 
file, such as a geographic base file. The current sources of information 
developed in municipalities, in particular the property data of the tax 
assessor function and the operating files of various service delivery 
functions, provide a rich source of information, augmented by special ^ 
collections such as the us census. Data extraction is the process of 
developing integrated data subsets from diverse source files to support 
interactive problem solving. Extraction provides the interface between 
large data bases of source files and problem solving systems through data 
matching , subsetting, and aggregation functions. Experience with gads 
(geo-daa analysis and display system) has shown that data extraction is 
useful when the user or problem characteristics require access to varying 
amounts, detail, and selection of data, and conversational (rapid to 
response) interaction with ^yproblem solving system. These characteristics 
are likely to be encountered when designing problem solving systems for 
nonprogrammer , professional users working on unstructured problems. The 
data extraction interface matches the functional and response time 
requirements of interactive problem solving, can be implemented on a 
variety of computer system configurations, and can reduce the operating 
costs of the problem solving system. Because data extraction operations can 



produce multiple extracted ^d^ta bases, with different structures , a 
single data extraction interface can support multiple problem solving 
systems. In addition, existing problem solving systems can be supported and 
enhanced by data extraction without major program revisions. 

Classification Codes and Description: 5.01 (File Design, Building, and 
Updating) 

Main Heading: Information Processing and Control 
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The locus of the retention differences associated with degree of 
hiearchical conceptual structure. 

Book Title: Contract N00014-67-a-0356-0010 . 1973 September. Northwestern 
University, Evanston, Illinois. 43 P. Ntis: Ad-768 019/2ga; He $3.00, Mf 
$1.45. 

Author (s): Shaughnessy, Johni)^ Underwood, Benton J; Zimmerman, Joel 

Publication Date: 1973 

Language: English 

Document Type: Book Chapter 

Record Type: Abstract 

Journal Announcement: 0900 

The intent of the study was to indentify more precisely the 
characteristics of the memory for structured and unstructured lists . 

By so doing, it was believed that the characteristic or characteristics 
responsible for the differences in retention might be isolated. Two 
different levels of learning were used as a means of varying the degree to 
which the conceptual structure had become a part of the memory for the 
lists. Since the utilization of a conceptual structure to mediate item 
placement at the time of retention may be critically time dependent, three 
diferent types of retention tests were used, namely, paced and unpaced 
recall, and unpaced associative matching 

Classification Codes and Description: 2.00 (General Aspects) 
Main Heading: Research Methods 
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File organization for a large chemical information system. 

Book Title: Nbs Technical Note 285. 1966 April 18. National Bureau Of 
Standards. Superintendent Of Documents, Govt. Print. Office, Washington, 
D.c. $.25. 

. Author (,s ) : Anderson, . Ruth; Ma rden, Ethel 
Corporate Source: Institute For Applied Technology, National Bureau Of 

Standards 
Publication Date: 1966 
Language: English 
Document Type: Book Chapter 
Record Type: Abstract 
Journal Announcement: 0100 

The report describes a fi^l^ ^ structure which combines list -processing 
concepts (for handling variable length information records) with standard 
serial record arrangements (for identification information ). The file 
organization was designed for a large chemical information system and 
includes both well-structured and unstructured (amorphous) information . 
An investigation was made of representative data inputs from the department 
of the army. The data to be put into the file, the nature of the file 



structure, and the necessary programs for manipulation of file information 
have been considered as interdependent parts of a total system. Computer 
programs have been initiated to test the validity of the proposed approach. 

Classification Codes and Description: 5.01 (File Design, Building, and 
Updating) 

Main Heading: Information Processing and Control 
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Title: On Z39. 50 wrapping and. description logics 

Author(s): Velegrakis, Y. ; Christophides , V.; Constantopoulos , P. 
Author Affiliation: Dept. of Comput . Sci . , Toronto Univ., Ont . , Canada 
Journal: International Journal on Digital Libraries vol.3, no . 3 p. 

208-20 

Publisher: Springer-Verlag, 

Publication Date: Oct. 2000 Country of Publication: Germany 

CODEN: IJDIFR ISSN: 1432-5012 

SICI : 1432-5012 ( 2 00010 ) 3 : 3L . 208 : WDL; 1-G 

Material Identity Number: H423-2002-001 

U.S. Copyright Clearance Center Code: 1432-5012/ 00/$2 . 00+0 . 20 
Language: English Document Type: Journal Paper (JP) 
Treatment: Bibliography (B) ; Practical (P) 

Abstract: Z39.50 is a client/server protocol that is widely used in 
digital libraries and museums for searching and retrieving information 
spread • -over a -number - of heterogeneous- sources. To overcome semantic and 
schematic discrepancies among the various data sources, the protocol relies 
on a world view of information as a flat list of fields, called access 
points (AP) . One of the major issues for building Z39.50 wrappers is to map 
this unstructured list of APs to the underlying source data 

structure and semantics. For highly structured sources (e.g. database 
management systems, knowledge base systems) this mapping is quite complex 
and considerably affects the quality of the retrieved data. Unfortunately, 
existing Z39.50 wrappers have been developed from scratch and they do not 
provide high-level mapping ^ languages with verifiable properties. In this 
paper, we propose a description logic (DL) based toolkit for the 
declarative specification of Z39.50 wrappers. We claim that the 
conceptualization of AP mappings enables a formal validation of the query 
translation quality (e.g., ill-defined mappings, inappropriate APs, etc.) 
and allows one to tackle a number of Z39.50 pending issues (e.g. meta-data 
retrieval, query failures due to unsupported APs, etc.). Furthermore, our 
DL-based approach all ows the development of Z39.50 wrappers enriched with a 
number of added-value services such as conceptual structuring of flat 
Z39.50 vocabularies and intelligent Z39.50 query assistants. These services 
are quite useful for profile developers, Z39.50 wrapper administrators and 
end-users. (54 Refs) . - . ... 

Subfile: C 

Descriptors: access protocols; client-server systems; computer aided 
software engineering; data structures ; digital libraries; formal logic; 
formal specification; query processing; vocabulary 

Identifiers: Z39.50 wrappers; description logics; client/server protocol; 
digital libraries; museums; information retrieval; heterogeneous 
information sources; semantic discrepancies; schematic discrepancies; data 
sources; flat field list; access points; source data structure ; highly 
structured sources; database ^management systems; knowledge base systems; 
retrieved data quality; high-level mapping languages; verifiable properties 
; declarative specification; mapping conceptualization; formal validation; 
query translation quality; ill-defined mappings; meta-data retrieval; query 
failures; added-value services; conceptual structuring; vocabularies; 
intelligent query assistants; profile development; query rewriting 

Class Codes: C5640 (Protocols); C4210 (Formal logic); C6150N ( 
Distributed systems software); C6120 (File organisation); C6110F (Formal 
methods); C7250 (Information storage and retrieval); C7240 (Information 



analysis and indexing); C6115 (Programming support) 
Copyright 2002, IEE 
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Author Affiliation: Dept. of Inf. Technol., Zurich Univ., Switzerland 
Conference Title: Proceedings of the Ninth International Conference on 
Information and Knowledge Management. CIKM 2000 p. 453-60 
Editor(s): Agah, A.; Callan, J.; Rundensteiner, E. 
Publisher: ACM, New York, NY, USA 

Publication Date: 2000 Country of Publication: USA xvi+532 pp. 
ISBN: 1 58113 320 0 Material Identity Number: XX-2000-0314 6 

U.S. Copyright Clearance ^C^nter Code: 1 58113 320 0/2000/0011 ... $5 . 00 
Conference Title: Proceedings of Ninth International Conference on 
Information and Knowledge Management (CIKM) 
Conference Sponsor: ACM 

Conference Date: 6-11 Nov. 2000 Conference Location: McLean, VA, USA 
Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: Data available online today is spread over heterogeneous data 
sources like traditional databases or sources of various forms containing 

unstructured and semistructured data . Obviously, the "technical" 
availability alone is not at all sufficient for making meaningful use of 
the existing information, ^and thus _ the problem of effectively and 
efficiently accessing and querying heterogeneous data is an important 
research issue. One main approach is to integrate the data sources and 
offer users an a priori defined global schema. Alternatively there are 
approaches which implement tools for giving users the possibility to define 
the global schema themselves. We propose a new approach where heterogeneous 
sources can be queried through a unified interface and underlying sources 
are integrated by means of a query language only. We present extensions to 
OQL which allow to query structurally heterogeneous data , i.e. 

structured , semistruct^u^ed and unstructured data alike, and to 

integrate data on-the-fl-y. We also deal with issues of the extensive meta 
information about sources that is needed for integration. (17 Refs) 
Subfile: C 

Descriptors: distributed databases; object-oriented databases; query 
languages; query processing 

Identifiers: heterogeneous data source integration; heterogeneous 
databases; query processing; global schema; unified interface; query 
language; OQL; meta information; object oriented database 

Class Codes: C6160B ( Distributed databases ) ; C6160J (Object-oriented 
databases); C6140D (High level languages) 
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Title: Enabling the enterprise portal 

Author (s): Gonzales, M.L.^yx 7 

Journal: DB2 Magazine - vol.5, no . 1 p. 24-6, 28-9 
Publisher: Miller Freeman, 

Publication Date: Spring 2000 Country of Publication: USA 
CODEN : DBMAF5 

Material Identity Number: G132-2000-001 

Language: English Document Type: Journal Paper (JP) 

Treatment: Practical (P) 

Abstract: For IBM, the solution to enabling Web-based portal systems is 



not in front end technology, but rather in the underlying work of 
middleware technology. As a result of coordinated efforts between Lotus and 
IBM, Enterprise Information Portal (EIP) is able to support advanced 
textual analysis, document management and collaboration, and expert 
profiling. But the most profound feature of EIP is the API for building, 
customizing, and maintaining data access connectors. The API library and 
related toolset, including fprebuilt connectors to a number of common data 
stores, allows user organisations and third-party vendors to rapidly deploy 
complex portal applications that traverse multiple, disparate data stores 
of structured and unstructured data . EIP has a multi-tier 

architecture. It includes direct connectivity through a browser or Java 
client using Java, C++, ActiveX, or dynamic server pages. And it uses DB2 
Universal Database as the portal database. (0 Refs) 
Subfile: C 

Descriptors: business data processing; client-server systems; data mining. 
; relational databases 

Identifiers: Web-based portal systems; middleware technology; Lotus; IBM 
Enterprise Information P.ortal ,\ ...textual analysis; document management;, 
document collaboration; expert profiling; data access connector; API 
library; toolset; prebuilt connectors; user organizations; third-party 
vendors; multi-tier architecture; Java client; browser; C++; ActiveX; 
dynamic server pages; DB2 Universal Database 

Class Codes: C7100 (Business and administration); C6160D (Relational 
databases) ; C6150N (Distributed systems software) 
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Title: Knowledge discovery in digital libraries 

Author(s): Vishwanathan, T.; Zarrintaj Bornaee; Gupta, R.G. 
Author Affiliation: INSDOC, New Delhi, India 

Conference Title: Towards the New Information Society of Tomorrow: 
Innovations, Challenges and Impact. Papers presented at the 49th FID 
Conference and. Congress . . p.IIIn2-.ol . . 

Editor (s): Malwad, N.M.; Asundi, A.Y.; Ravichandra Rao, I.K.; Rajashekar, 
T.B.; Sridhar, M.S.; Sreenivasa Rao, K. 

Publisher: Indian Nat. Sci . Documentation Centre, New Delhi, India 

Publication Date: 1998 Country of Publication: India xvi+518 pp. 

Material Identity Number: XX-1998-03585 

Conference Title: Proceedings of 49th Conference of the International 
Federation for Information and Documentation 

Conference Date: 11-17 Oct. '1998 Conference Location: New Delhi, India 
Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P)' 

Abstract: Libraries world-wide are undergoing a transformation process 
from traditional to automated to electronic libraries with CD-ROM 
resources, LAN with CD-servers and access to the Internet. Electronic 
libraries that have been operational for some time are considering a move 
towards digital libraries where the entire resources of a library will be 
digitised and made accessible on networks. Digital libraries would have 
very large data sets in digital form and would need to apply data 
warehousing concepts for organised storage of their data. In essence, 
digital libraries are large electronic knowledge houses. Knowledge 
discovery in digital . libraries , S KDDX) . then becomes an important .issue. Data 
mining is that branch of knowledge that deals with discovery of hidden 
knowledge, unexpected patterns, and new rules from large electronic 
databases. A natural approach to knowledge discovery in digital libraries 
is to apply the existing data mining techniques. But the present data 
mining effort is largely directed towards applications related to financial 
data, large customer bases, help desk service records, etc. These 
applications are characterised by large-scale structured and related 
data - In contrast, digital library knowledge houses contain enormous 
amounts of unstructured J data in a variety of subject areas. As a 



result, the present data mining techniques are not ideally suited for 
knowledge discovery in digital libraries. This paper, after discussing the 
library transformation process and the non-suitability of present data 
mining techniques for knowledge discovery in digital libraries, proposes a 
new class of techniques for ■ KDDL based on knowledge organisation concepts 
widely applied in libraries^X 123 Refs) 
Subfile: C 

Descriptors: data mining; data warehouses; digital libraries; library 
automation 

Identifiers: knowledge discovery; digital libraries; library automation; 
CD-ROM; LAN; CD-servers; Internet access; very large data sets; data 
warehousing; organised storage; data mining; knowledge organisation 
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Author Affiliation: Dept ^ J of Comput . Sci., Brigham Young Univ., Provo, 
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Journal: SIGMOD Record Conference Title: SIGMOD Rec. (USA) vol.28, 
no. 2 p. 467-78 
Publisher: ACM, 

Publication Date: June 1999 Country of Publication: USA 

CODEN: SRECD8 ISSN: 0163-5808 
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Material Identity Number: A660-1999-002 

U.S. Copyright Clearance Center Code: 0163-5808/ 99/$ 05 ... $5 . 00 
Conference Title: 1999 ACM SIGMOD International Conference on Management 
of Data 

Conference Date: 1-3 June 1999 Conference Location: Philadelphia, PA, 

USA 

Language: English Document Type: Conference Paper (PA); Journal Paper 
(JP) 

Treatment: Practical (P) 

Abstract: Extraction of information from unstructured or 

semistructured Web documents often requires a recognition and delimitation 
of records. (By "record" we mean a group of information relevant to some 
entity.) Without first qhi^iking documents that contain multiple records 
according, to record boundaries, extraction of record information will not 
likely succeed. We describe a heuristic approach to discovering record 
boundaries in Web documents. In our approach, we capture the structure of a 
document as a tree of nested HTML tags, locate the subtree containing the 
records of interest, identify candidate separator tags within the subtree 
using five independent heuristics, and select a consensus separator tag 
based on a combined heuristic. Our approach is fast (runs linearly for 
practical cases within the context of the larger data extraction problem) 
and accurate (100% in the experiments we conducted). (15 Refs) 
Subfile: C 

Descriptors : , document handling; heuristic programming; hypermedia markup 
languages; information resources; Internet; tree data structures ; trees 
(mathematics 

Identifiers: record-boundary discovery; multiple records; information 
extraction; semistructured Web documents; record boundaries; record 
information; heuristic approach; nested HTML tags; subtree; candidate 
separator tags; consensus separator tag; combined heuristic; data 
extraction problem 

Class Codes: C7210N (Information networks); C6130D (Document processing 
techniques); C6120 (File qr^ahisation) ; C1160 (Combinatorial mathematics) 
; C6130M (Multimedia); C6L40D (High level languages) 
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Title: Construction of global dynamic dictionary in heterogeneous data 
integration system 

Author (s): Wang Ning; Xu Hong-Bing; Wang Neng-Bin 

Author Affiliation: Dept. of Comput. Sci . & Eng., Southeast Univ., 
Nanjing, China 

Journal: Chinese Journal of Computers vol.22, no.l p. 103-7 
Publisher: Science Press, 

Publication Date: Jan. 1999 Country of Publication: China 
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Material Identity Number: B714-1999-003 

Language: Chinese Document Type: Journal Paper (JP) 

Treatment: Theoretical (T) 

Abstract: A heterogeneous data integration system can integrate not 
only the data with definite data schemata of the traditional database 
systems but also the semir structured or unstructured data with 
undefined or no data sche^mata of such data sources as WWW. In database 
system, schemata are verY important for the user to construct the 
meaningful queries and for query processor to do proper decomposition and 
optimization. Without storing objects in global site, global schemata in a 
data integration system can not be constructed by scanning entire 
databases as done in other systems. A dynamic dictionary composed of 
templates, which are based on rooted connected directed graphs and able to 
describe not only the structures but also the behaviors of objects, is 
proposed as unified schemata for various heterogeneous data sources. The 
theoretical foundations of dynamic dictionaries along with five template 
operations are presented. With these template operations, global dynamic 
dictionaries for a data integration system can be constructed just by 
operating on dynamic " dictionaries "o'f loc'al data sources,. (7 Refs) 

Subfile: C 

Descriptors: database management systems; distributed databases 

Identifiers: global dynamic dictionary; heterogeneous data integration 
system; database system; query processor; schemata; dynamic dictionary; 
heterogeneous data sources; global dynamic dictionaries 
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Conference Title: Proceedings of International Conference on Information 
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Language: English DocjilTtent Type: Conference Paper (PA) 
Treatment: Theoretical (T) 



# Abstract: We do not aim to create a single multimedia database able to 
store a multimedia document as instance of a primitive data type. Instead, 
we propose that diff erent 7 heterogeneous databases (each handling a 
different data type) cari^ be composed to handle a multimedia document as 
instance of a complex data type. The complex data type is composed starting 
from the the primitive data types. The resulting multimedia database 
requires the cooperation of several databases to handle: complex queries 
spanning over several types of data. Similarly, for complex transactions. 
Thus a uniform query and transaction language is required to handle this 
heterogeneity. This does not mean that optimization techniques on the 
border among the different databases cannot be exploited, Indeed, a novel 
class of join methods have been developed to overcome this drawback. To 
this end, we discuss a new approach based on a logic language that will 
allow to describe structured and unstructured data , to express 

queries' 'modelling certain and •uncertain 'inference . (14 Refs) 
Subfile: C 
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Abstract: To support future digital library systems which draw 
information from different 'sources' on" the Internet, we have* to provide 
integrated queries to pre-existing database servers which contain 
structured, semi structured and unstructured data . We specifically 
examine the problem of querying both existing structured relational 
databases and bibliographic databases. By adopting the well accepted Z39.50 
standard protocol to access bibliographic databases in different legacy 
library systems, we have developed an extended SQL model, known as 
Harp-SQL, to support integrated queries to both SQL databases and 
bibliographic databases. yUsing Harp-SQL, one can not only query 
bibliographic databases ffr an SQL manner, but also perform joins between 
SQL and bibliographic databases. A distributed query processor supporting 
Harp-SQL queries has been developed. We present our query processing 
strategy that is based on a client server interaction model between the 



distributed query processor and the various remote database servers. (20 
Refs) . J 
Subfile: C ^ 
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Abstract: The author presents a software review of askSam Systems 1 askSam 
3.0 Professional which promises to do what ordinary databases cannot. 
Databases are designed to deal with structured data . They require you 
to predefine a structure and shoehorn your information into that 
structure . Long a popular product for desktop information management, 
askSam was designed to hanjlle unstructured data . Unlike traditional 
databases, askSam does *no\ require predefined field types and field 
lengths. However, while you can store formless , unfielded information 
in its database, doing so pretty much limits you to full-text searching. 
With askSam, you can create a database for CD-ROM, diskette, or Web 
distribution with as strict or loose a structure as you wish. (0 Refs) 
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Abstract: The amount of data of all kinds available electronically has 
increased dramatically in recent years. The data resides in different 
forms, ranging from unstructured data in file systems to highly 

structured in relational database systems. Data is accessible through a 
variety of interfaces in.c^u&ing Web browsers, database query languages, 
application-specific inter-faces, or data exchange formats . Some of this 
data is raw data, e.g., images or sound. Some of it has structure even if 
the structure is often implicit, and not as rigid or regular as that 
found in standard database systems. Sometimes the structure exists but has 
to be extracted from the data. Sometimes also it exists but we prefer to 
ignore it for certain purposes such as browsing. We call this data as 
semi- structured data which is (from a particular viewpoint) neither raw 
data nor strictly typed, i.e., not table-oriented as in a relational 
model or sorted-graph as in object databases. The main purpose of the paper 
is to isolate the essential aspects of semistructured data. We also survey 
some proposals of models and ( query, languages for semi- structured <£ata . 
In particular, we consider recent works at Stanford U. and U. Penn on semi- 
structured data . In both cases, the motivation is found in the 
integration of heterogeneous data. (49 Refs) 
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Abstract: Companies are .rightfully skeptical of anything associated with 
the paperless office, a- worn-out phrase used far more often than the 
products it describes. Document management has been a sleepy market 
associated with expensive imaging systems and complicated workflow 
products. But with the influence of several technologies, including the 
Web, document management and imaging is now crossing the threshold into the 
mainstream. In the Web, the document management market has finally found an 
easy, standard way to archive, distribute and display documents. The Web is 
to unstructured data , such as documents and text files, what relational 
databases are to structured data . It also changes the publishing 

hierarchy, making the end user a content publisher. This means- that 



document management systems will have to easily incorporate documents 
created with a variety of front-end tools. (0 Refs) 
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Abstract: Image databases will require a completely new organization due 
to the unstructured and 'perceptual' structure of the data they 
contain. The authors argue that similarity measures, rather than matching 
, will be the organizing principle of image databases. Similarity is a very 
elusive and complex judgment, and typical databases will have to rely on a 
number of different metrics to satisfy the different needs of their users. 
This poses the problem of how to combine different similarity measures in a 
coherent and intuitive way: They propose' "a solution, which is ldosely'based 
on ideas derived from fuzzy logic in that it uses the equivalent in the 
similarity domain of the and, or and not operations. The approach is much 
more general than that, however, and can be adapted to work with any 
operation that combines together similarity judgment. With this approach, a 
query can be described as a directional acyclic graph with certain 
properties. They analyse briefly the properties of this graph, and present 
the interface they are developing to specify these queries. (25 Refs) 
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Abstract: AskSam for Windows 7 version 2.0 is a database program that looks 
like a word processing ' program and is excellent for organizing and 
searching free- form textual information . It offers full text data 
searches. The program is easy for anyone to use who is familiar with a word 
processing program. It is meant for people who are not computer 
professionals but who need to organize and use vast amounts of structured 
and unstructured information more productively. The askSam program 

does not require . a user to know programming, relational database 
concepts, or data structures - (0 Refs) 
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Abstract: The paper discusses indexing in the context of manual and 
conventional text retrieval systems. It argues that adaptive pattern 
recognition technology overcomes many of the problems associated with these 
systems and discusses the benefits to be gained through automatic indexing 
of both structured and 441^5 true tured information . (0 Refs) 
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Abstract: A large percentage of valuable information is not stored in 
database management systems. Therefore, many applications must deal only 
only with heterogeneous ^ structured t _ data , but with a wide variety of 
unstructured and semi- structured data such as electronic mail, 

documents, files and spreadsheets. In addition, these applications must 
deal with heterogeneous services such as an electronic library or an 
airline reservation service. While it is, of course, important to study 
interoperability among heterogeneous conventional database systems, it is 
also very important to study interoperation in this broader domain, where 
application programs must interact with a wide variety of services and must 
handle many types of structures, unstructured, and semi- structured data 
(7 Refs) 
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Abstract: The merging of hypermedia and intelligent tutoring can occur 
through two approaches: integrating intelligent tutoring into hypermedia; 
and integrating hypermedia into intelligent tutoring. The former involves 
enhancing a basic hypermedia system with features that provide more 
structured browsing or tutoring, ranging from the traditional to the 
Al-based approaches. The second approach is the one explored by the author. 
He examines some of the possibilities of integrating hypermedia into 
intelligent tutoring, and some of the problems that are likely to be 
encountered. He uses the GEO geographical tutor and the OPTICLAB optics 
workbench to illustrate some of the views expressed in this paper As 
information resources, intelligent tutoring systems (ITSs) are quite 



different from hypermedia systems (HMSs), principally because of the ITS 
emphasis on representation of knowledge. ITSs are artificial intelligence 
(AI) systems, whereas HMSs are not . The crucial distinction between them 
lies in how they treat information . ITSs structure information into a 
knowledge base, whereas HMSs keep information unstructured and use it 
as data . The structured information of ITSs involve semantic 

relations constituting a knowledge web that can be manipulated and 
interpreted by simple goal-directed rules. The information is thus 
computationable, which gives it the power ascribed in general terms to AI 
technology. (20 Refs) 
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Abstract: Describes the ^distributed approach to fault-tolerant Data 
Management Systems (DMS).^tor use in military C3I systems, supported by 
Selenia MARA computers. The DMS supports concurrent atomic transactions on 
databases composed of information held partly as unstructured or indexed 
sequentially structured files on disk and partly as tables in central 
memory. When used on duplicated hardware configurations, the DMS 
automatically duplicates the database informations thus supporting 
applications requiring continuous availability. Major characteristics of 
the solution are the greatest modularity both at hardware and software 
level and the performance aspects implemented to meet real time 
requirements. (4 Refs) 
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Abstract: There are at least two approaches to describing a data model as 
an algebraic object. One can give a description of the data structure 
(organization) , and then the operations upon the data structures in 
each particular case will be induced by the detailed structures. The dual 
approach consists in axiomatizing the operations upon the data- 
structures as unstructured sets. In each particular case, the data 
structures are derived from the corresponding operation sets . One can 
give two arguments in fa^or of the second approach, which the author has 
used. The first is that an object in the algebraic theory should be 
characterized uniquely by the set of its morphisms and the set of the 
values of the functors for the object from the category theory viewpoint. 
The second is that not all natural operations upon data structures 
can be defined without a knowledge of the semantics. The author constructs 
the algebraic theory of data models. (7 Refs) 
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Abstract: The authors' symbolic shorthand system should be considered 
both as a system language of their Documentation Service and as a 
sensitive, precise medium of interpersonal, communication. It is .a system of 
pre-coordinated indexing in which relationships between nodes are 
registered. It can be adapted to other scientific areas, especially if 
their content lends itself to organization in terms of causes and effects. 
It is less suitable though also usable in purely descriptive scientific 
fields which merely require the registration of unstructured data . (3 
Refs) 
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Talks about biotechnology firm Genentech' s deployment of Venice Bridge, 
middleware from Venetica that will make it easier for Genentech to link its 
image repository system to structured data in Oracle Clinical, an 

application used for managing clinical trials. Mentions that linking the 
image repository and clinical-data repository is valuable because it lets 
Genentech employees who uncover possible errors in the clinical database to 
refer to an image of the^^original record. Reports that Venice Bridge may 
also let Genentech connect the image repository with other pools of 
unstructured content. Explains that middleware means companies do not 
need to merge their systems for unstructured content. Says that IBM Corp. 
has developed the Enterprise Information Portal (EIP) middleware for 
unstructured information . Includes a diagram. (MEM) 

Descriptors: Middleware; Bridge; Information Technology; Information 
Management; Connectivity; Cross-Platform Computing 
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Order out of chaos : trends in search 
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Languages: English 
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Discusses trends in search and information retrieval technologies. 
Explains that the increasing importance of mobile access to corporate data 
and applications will raise new issues for users and new opportunities for 
suppliers. Mentions that going beyond simple keyword and Boolean searches, 
most information retrieval products offer the following advanced tools and 
approaches: elementary linguistic analysis; thesaurus capabilities; pattern 

matching tools; semantic analysis tools; document categorization or 
clustering; personalization; and user profiling. Says that categorization 
tools 'a"re designed to organize* la r*ge ' bodies' of unstructured 'information 

into a form that is easy to navigate. Points out that in theory, 
supporting information retrieval over mobile devices is not a major 
challenge, in light of the fact that the difficult part of retrieval has 
always been the indexing. Includes two photos. (MEM) 
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Infrastructure critical to enterprise portals — Portals can give a 
single access point to corporate data, but not until a framework and 
structure are built to better organize and . . . 

Gates, Lana 

Application Development Trends , August 1, 2000 , v7 n8 p31-36, 5 
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Focuses on enterprise portals (EPs), which provide presentation services 
delivered by a Web browser-based personalized interface, have a single 
point 'of access that provides system 'Security, and allow user's to access 
corporate information and applications. Explains that the enterprise 
information portal must have the ability to personalize the portal, offer 
some level of tracking, and have a platform on which people can 
collaborate, share information, and note who is accessing what type of 
information. Considers the challenges of accessing both structured and 

unstructured data , and of converging technologies into a single product 
and. then providing knowledge services..- Also reports .on Raven, a- beta - 
product from Lotus Developme^nJ: Corp. which has a three-layer architecture: 
the presentation layer or' us^er interface; the discovery and personalization 
layer; and the information access layer. Includes one illustration. 
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Focuses on the need for supporting infrastructure in unified access to 
information. Points out thal^ pimply providing access under a single portal 
does not solve some of^the key problems, such as structure and format. 
Mentions several partial solutions, though points out they are still 
flawed. Suggests that though portals are still not meeting the needs of 
today's customers, they are expected to be the solution of the future. 
Explains the evolution of portals can lead to integrated access to 
heterogenous types of data. Overviews three "'waves 11 of the portal 
revolution: user interface-led integration; separate but equal access to 
structured and unstructured data ; and unified structured and 

unstructured data access. Claims the future should include tracking of 
how the data is used, which will improve collaboration and provide means 
for a smart push of information to users, (kgh) 
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Beyond the hype, wha;£^ will an EIP do for you? With the right 
solution, you can tap the benefits of centralized information, application 
access 

Borck, James R 
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Presents a guide to enterprise information portals (EIPs) . Says that 
EIPs streamline access to data and mission-critical applications. Explains 
that EIPs enable improved decision-making capabilities and integrate easily 
into existing technologies. Mentions the drawbacks: lack of portal 
standards, costliness, ajid. EI EJ , solutions. 1 .. la,ck of administration functions. 
Lists EIP characteristics: provisions for taxonomy and application 
integration, searching and indexing of structured and unstructured 

data across repositories, and access to enterprise resource planning 
(ERP) and legacy mainframe data. Shows a table comparing five EIP solutions 
from five providers on ease of installation, data source/application 
availability, security and authenticity, scalability, personalization and 
customization, administration, Web-based indexing, and content subscription 
plug-ins. Includes a screen display, eight benchmark test . results , a table, 
and a sidebar. (MEM) 4 
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Plumtree Software; DataChannel; Hummingbird Communications; iPlanet 
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DIALOG (R) File 233: Internet & Personal Comp . Abs . 
. (c). 20Q3. EBSCO.Pub. All xts. r,Q£er.v. . ... 

00399160 95PW10-016 
ask Sam 

Miastkowski, Stan 

PC World , October 1, 1995 , vl3 nlO p76, 1 Page(s) 

ISSN: 0737-8939 

Company Name: askSam Systems 

Product Name: askSam for Windows 

Languages: English J 

Document Type: Software' Review 

Grade (of Product Reviewed) : B 

Hardware/Software Compatibility: IBM PC Compatible; Microsoft Windows 
Geographic Location: United States 

Presents a favorable review of askSam 3.0 for Windows ($149.95), an 
database manager from askSam Systems (800) . The program works well in 
managing unstructured information . Merely load file containing 

information of interest and the program allows searching by phrase or 
using Boolean operators to find the precise information desired. This 
release can import HTML documents and it also supports NN fuzzy 11 searches. 
■For structured. -data ,tfre pxogram allows the creation of fields and 
forms as competing programs do. Includes one screen display, (djd) 

Descriptors: Data Base Management; Software Review; Window Software 

Identifiers: askSam for Windows; askSam Systems 
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DIALOG (R) File 233: Internet & Personal Comp. Abs. 
(c) 2003 EBSCO Pub. All rts. reserv. 

00354844 94WN07-036 

Do it yourself with point-and-click programming — If off-the-shelf 
applications don f t quite fill the bill, why not develop a custom app that 
suits you to a T? See how we designed. . . 

Moran, Enis 

Windows Magazine , July 1, 1994 , v5 n7 p204-214, 8 Page(s) 
ISSN: 1060-1066 
Company Name: Microsoft 

Product Name: Visual Basic Professional Edition 
Languages: English 

Document Type; Featur.e ArticJ.es. .and N.ews . , . 



Geographic Location: United States 

Shows how to create a custom programming application, that is, an 
address book that can be shared over a local area network, using tools from 
Microsoft's Visual Basic 3.0 Professional Edition. Attention is given to 
the cardfile-type user interface; importing and merging existing cardfile 
data; .the interface's 3-D panel and card captions; searching the database 
using a for... next loop; and snaring on the network while keeping all users 
in sync. Discusses some special problems in importing existing cardfiles, 
which result from having to insert unstructured data into a structured 
database. Also considers simplifying the cut and paste operation; adding 
an autodial feature; and composing a help file and setup program. Sidebars 
briefly describe Visual Basic along with its Windows version and its 
Applications Edition; and show how custom controls can save time and 
effort. Includes four screen displays and two program listings, (jo) 

Descriptors : Programming. V design; Basic;.- User Interface;. -Programming 
Aids; Programming Instruction ; Contact Manager; Tutorial 

Identifiers: Visual Basic Professional Edition; Microsoft 
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DIALOG (R) File 233: Internet & Personal Comp. Abs . 
(c) 2003 EBSCO Pub. All rts . reserv. 

00121393 86PK05-130 

Text-based systems manage data in a number of ways 

Sullivan, Kristina B ^ s t w m t _ 

PC Week , May'l3 1986 ,*v3*nl9 pl33, T Pages 

ISSN: 0740-1604 

Languages: English 

Document Type: Article 

Geographic Location: United States 

Examines some of the approaches offered by text-based database systems 
to organize and retrieve data. Says that choices include programs that do 
or do not index data, "and those that provide for structured or 
unstructured data " . 7 

Descriptors: DATA BASE MANAGEMENT; TEXT 
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DIALOG (R) File 94 : JICST-EPlus 

(c)2004 Japan Science and Tech Corp(JST). All rts. reserv. 

03485026 JICST ACCESSION NUMBER: 98A0246454 FILE SEGMENT: JICST-E 
Relation-based Management of Semi- Structured Data . 
TOOYAMA MOTOMICHI (1) 

(1) Kei.o. Univ. , Fac.of Sci. and Technol.^ 

Jotio Shori Gakkai Kenkyu* Hokoku* 1998, VOL . 98 , NO . 2 (DBS- 114 ) , PAGE. 105-112, 

FIG. 5, REF.ll 
JOURNAL NUMBER: Z0031BAO ISSN NO: 0919-6072 
UNIVERSAL DECIMAL CLASSIFICATION: 681.3:061.68 
LANGUAGE: Japanese COUNTRY OF PUBLICATION: Japan 

DOCUMENT TYPE: Journal 
ARTICLE TYPE: Original paper 
MEDIA TYPE: Printed Publication 

ABSTRACT: In this paper, we^^-il discuss the possibilities and the 

limitations of relation based management of semi- structured data . 
Semi- structured data are explained as something in the midway from 
unstructured data to highly structured data . On discussing 
management of semi- structured data , we should exploit the 
regularity that are not expected in unstructured data , while 
allowing irregularities forbidden in highly structured data - From 
our past experiences on providing HTML or other kinds of applicational 
data as a dynamic heterogeneous view of a relational database, we have 
recognized that some sorts of irregularities could be handled and 
realized nicely in this architecture, leaving relational database 
itself as a rigid dat:a % repository. Especially, the dynamic invocation 
of a query, along with the lazy evaluation nature of Web-like 



interaction, is presented as an elegant solution for navigational 

accesses in recursive data references, (author abst.) 
DESCRIPTORS: relational data base; recall precision; data structure ; 

performance evaluation; flexibility; dynamic system; query processing; 

information system; lazy evaluation 
BROADER -DESCRIPTORS :■ database* *ef f i-cienoy-; structure; evaluation; property 

system; information processing; treatment; computer application system 
CLASSIFICATION CODE(S): JD03030U 





S&t Items Description 

51 1615354 INDEX? OR INDICES OR LIST? ? OR REGISTRY OR REGISTRIES OR 

DIRECTORY? 

52 9887096 COMBINE? OR LINK? OR ASSOCIAT? OR RELAT? OR JOIN? OR CONNE 

CT? 

53 20532 (STRUCTURE? OR FORM? OR CONFIGURATION? OR SYNTHESIS? OR AR 

RANGEMENT?) (2N) SI 

54 22364 DATA (2N) STRUCTURE? OR SEQUENCE ( 2N) CHARACTER? 

55 414 8680 EQUAL? OR CORRESPOND? OR MATCH? OR PARALLEL? OR SAME OR (G 

REATER OR LARGER OR BIGGER OR LESS) (2N) THAN 

56 134 UNSTRUCTURED (2N) SI 

57 6821832 BOOLEAN () (OPERATOR? OR VALUE?) AND "OR" OR LOGICAL () INCLUS 

ION OR XOR OR EXCLUSIVE "OR" OR "NOT" OR LOGICAL () NEGATION 

58 151982 (STRUCTURE? OR FORM? OR CONFIGURATION? OR SYNTHESIS? OR AR 

RANGEMENT?) (2N) (DATA OR INFORMATION) 

59 5503 UNSTRUCTURED (2N) (DATA OR INFORMATION) 

510 48356 SI (2N) S2 

511 22364 S4 OR SSSS5 

512 169318 S3 OR S8 

513 21904 Sll (S) S12 

514 5542 S6 OR S9 

515 635 S14 (S) S7 

516 167 S13 (S) S15 

517 46 S3 (S) S12 ' (S) S14 

518 6319 S2 (S) S13 

519 262 S10 (S) S13 

520 2 S19 (S) S14 

521 0 S10 (S) S3 (S) S6 

522 2 S10 AND S3 AND S6 

523 49 S17 OR S20 OR S22 

524 41 S23 NOT PY>2001 

525 35 S24 NOT PD>20010813 

526 30 RD (unique items) 
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DIALOG (R) File 647: CMP Computer Fulltext 
(c) 2004 CMP Media, LLC. All rts . reserv. 

01104094 CMP ACCESSION NUMBER: CWK19960923S0035 
Who's More Meta? - Users weigh which vendors can provide best 
me tadi rectory 

Sharon Fisher and Jeffrey Schwartz 
COMMUNICATIONSWEEK, 1996, n 630, PG19 
PUBLICATION DATE: 960923 

JOURNAL CODE: CWK LANGUAGE: English 

RECORD TYPE: Fulltext y 

SECTION HEADING: LANs & Servers - Network Operating Systems, Adapters, 
Wiring, Backup 

WORD COUNT:- 62 6 . - . - .... - . 

... 500 directory as a separate metadirectory product. 

The announcement will include Locator, a product that lets users " 
directory -enable" applications linked to the CDS X.500 directory. The 
tool will let users search for both structured and unstructured data 
, according to James Payne, marketing manager for E-mail integration. By 
making its directory available combined with Locator, users will be 
able to search any attribute in the directory, Payne said. Such 
attributes 1 



26/3, K/2 (Item 1 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

02414354 SUPPLIER NUMBER: . 63330202 (USE FORMAT 7 OR 9 FOR FULL TEXT) * 

Beyond the hype, what will an EIP do for you? With the right solution, you 

can tap the benefits of centralized information, application 

access. (Software Review) (Evaluation) 

Borck, James R. 
InfoWorld, 22, 28, 49 
July 10, 2000 

DOCUMENT TYPE: Evaluation ISSN: 0199-6649 LANGUAGE: English 

RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 217 6 LINE COUNT: 0018 9 

... right EIP for your business. 

At the very least, EIPs should offer features such as searching and 
indexing structured and' unstructured data across repositories , * both 
internal and external to a company. But to realize their full potential, 
EI Ps must . . . 



26/3, K/3 (Item 2 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

02099468 SUPPLIER NUMBER: 19681745 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Metadata: the missing link, (decision-support utilities) (Technology 
Information) 

Sherman, Richard P. 
DBMS, vlO, n9, p73(6) 
August, 1997 

ISSN: 1041-5173 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 44 61 LINE COUNT: 00371 

are deliveryAGENT, metaWAREHOUSE, and deliveryADMIN . 
The deliveryAGENT is the Web browser or Windows user interface to the 
information directory . ' Both structured ' and unstructured * data^ can 
be cataloged and delivered. Information is arranged as information objects 
called collections. Business users search for... 
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DIALOG ( R) File 275:Gale Group Computer. DB-t-TM). 
(c) 2004 The Gale Group. All rts . reserv. 

02032708 SUPPLIER NUMBER: 19030804 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Textbases deliver Web results . (Review of products offering text storage 

indexing and retrieving capabilities) (special supplement: Internet 

Systems) (Software Review) (Evaluation) 

Spitzer, Tom 

DBMS, vlO, nl, pS13(5) 

Jan, 1997 

DOCUMENT TYPE: Evaluation " ISSN: 1041-5173 LANGUAGE: English 

RECORD TYPE: Fulltext; Abstract 

WORD COUNT-: . 4678. . LINE. COUNT: . 00387. .„ . . . ... . . _ 

. . . indexes that point to the documents in their native locations and 

formats. Document management systems embed both structured and 
unstructured indexes and surround them with a groupware-style 
environment in which documents can maintain lives of their own. . . 

...to generate a list of hits that were only about high energy physics. The 
third icon — "Show linkage- " — « lists all- of the documents linked to the 
document that had been highlighted. Finally, if the browser selects... 
hooked to each function offered by the engine. The Vocabulary button 
requests the engine to display a form that lists nearby words in the 
index, in alphabetical order. The Thesaurus button requests the engine to 
display a form that lists synonyms for the search term. The Concepts 
button requests the engine to display a list of the... engine is a powerful 
option, too. Finally, PC DOCS' DOCS Open represents document management 
systems that package structured and text indexing search strategies 
within an active document repository system. 

One differentiator among these products, both within and across... 
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DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

02005229 SUPPLIER NUMBER: 18857624. (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Microsoft sews up its Nets; aims for seamless Inter-/intranet 
interaction. (Product Announcement) 

Leach, Norvin ....... ^ - . , . . . 

PC Week, vl3,.n45, p27(2) 
Nov 11, 1996 

DOCUMENT TYPE: Product Announcement ISSN: 0740-1604 LANGUAGE: 

English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 595 LINE COUNT: 00051 

... Falcon, a message queuing system; Denali, a scripting engine; OLE 

DB, a filter that lets users query structured or unstructured data ; 
and Active Directory, a new v 'directory structure that was formerly 
called OLE DS . 

Pieces of the Active Platform, such as a beta of Viper. . . 
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DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

01944265 SUPPLIER NUMBER: 18315429 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Multimedia and document DBMSs and tools. (1996 Database Buyer's Guide and 
Client/Server Sourcebook) (Buyers Guide)* 

DBMS, v9, n6, p72 (3) 
June 15, 1996 

DOCUMENT TYPE: Buyers Guide ISSN: 1041-5173 LANGUAGE: English 

RECORD TYPE: Fulltext; Abstract 



WORD COUNT: 3799 



LINE COUNT: 00335 



... -and lets multiple users, search, -retrieve, and analyze documents 

stored in almost any language. Features a powerful indexing structure 
that lets users pinpoint information by entering a word, concept, phrase, 
or combination of phrases, in any. . . 
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DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts . reserv. 

01804825 SUPPLIER NUMBER: 17155736 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Multimedia and document DBMSs and tools. (1995 Database Buyers Guide and 
client/ server sourcebook) (Buyers Guide) 

DBMS, v8, n6, p68(3) 
May 15, 1995 

DOCUMENT TYPE: Buyers Guide ISSN: 1041-5173 LANGUAGE: English 

RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 354 6 LINE COUNT: 00308 

stored documents by entering a word, concept, phrase, or 
combination of. phrases, In- any -.length . Offers an indexing structure / 
with users able to pinpoint critical information in seconds, even when 
searching through millions of documents in. . . 
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DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

01689852 SUPPLIER NUMBER^ 15399636 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Delivery and retrieval technology. (Seybold Special Report: Seybold 
Seminars Boston f 94, Part II) 

Seybold Report on Publishing Systems, v23, nl6, pS20(16) 
May 10, 1994 

ISSN: 0736-7260 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT 

WORD COUNT: 16097 LINE COUNT: 01272 

... as filing (check-in/check-out ) , security (access privileges) and 

control over revisions. DynaBase provides a hierarchical directory 
structure that is supplemented by attributes (metadata) for elements at 
any. level of the . hierarchy . A «Vwhere-used. . ..differences by applying a- 
different style sheet to the changed text. 

Another interesting application — again one that links the DynaText 
index to the DynaBase index — is multilingual publishing. Changes to a 
document can be highlighted in a DynaText ... was in getting Verity ! s engine 
to recognize Frame 1 s binary format. (The regular viewer reads mif format 
.) Indexes are built for a set of documents, not incrementally, which 
means a collection must be re-indexed. . .market leader. 

Jouve services. Jouve 1 s client-server GTI (graphics, text, images) 
tool set for electronic delsiyery indexes unstructured text or 
sgml-structured text and handles a variety of graphic formats. Its viewing 
tools offer a. . . 

...saved in bmp, tiff and jpeg formats. 

Rounding out the build kit are tools for validating the index 
configuration and a module for creating tapes for cd-rom mastering in ISO 
9660 format. The build tools... 

. . .expects to have that in the field this month. 

There are three levels to reading sgml — an indexer , a formatter 
and a viewer. All are supplied. in a- software developer's toolkit written in 

C + +. 

Client tools . . . 
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Direct Mail 1.0; MyMailList 1.1.2. (Software Review) (Preferred Publishers 
Inc's Direct Mail; MySoftware Go's MyMailList 1.1.2) (evaluation) 

Stevens, Lawrence 
Macworld, v7, n3, p205(2) 
March, 1990 yj 

DOCUMENT TYPE: evaluation „^ ISSN: 0741-8647 LANGUAGE: ENGLISH 

RECORD TYPE: ABSTRACT 

...ABSTRACT: print capabilities. Documentation for either program is poor. 
Direct Mail can save configuration work when- importing large unstructured 

lists , but MyMailList is a more powerful program for maintaining 
structured lists . 
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No-cost solutions : you may be able to use your current software to get 

organized. (Special Report - Getting Organized) (includes related article 

on text-based databases and note- taking programs) 

Davidson, Marc 

Lotus, v5, n6, p78(4) 

June, 1989 v j 

ISSN: 8756-7334 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 2798 LINE COUNT: 00209 

. . .ABSTRACT: personal information management program. Lotus Development 
Corp's Lotus 1-2-3 is excellent for managing highly structured 
information that requires only a few fields and is displayed in a list 
format , such as a Rolodex, a phone log, a daily calendar or a to-do list. 
Traditional databases. . . 

...Tate's dBase III PLus or Software Publishing Corp's Professional File 
are good for more extensive structured information such as a contact 
file or project tracking*. Highly unstructured information can be 
managed by an outliner. But systems are effective only if they are used. 
The fewer. . . 
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01298273 SUPPLIER NUMB£>: 07376284 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Products, (text and image -retrieval software packages) 

Frentzen, Jeffrey 

PC Week, v6, n24, p95(2) 

June 19, 1989 

ISSN: 0740-1604 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT 

WORD COUNT: 1921 LINE COUNT: 00153 

... Reference Technology Inc. Full Text Manager is a developers' 

package for text and image retrieval . Features include structured and 
unstructured indexing / a built-in thesaurus and document sizes as large 
as 16M bytes. 'Data ' can* be formatted 'according to the High Sierra and 
the International Standards Organization's technical specifications for CD 
ROM disks . . . 
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04835943 Supplier Number: 66677531 (USE FORMAT 7 FOR FULLTEXT) 
Dataware Tech. Changes Its Name, Refocuses Its Business Strategy. 

Electronic Information Report, v21, n40, pNA 
Nov 3, 2000 

Language: English Record Type: Fulltext 
Document Type: Newsletter; Trade 
Word Count: 456 

software provider to an e-commerce business solutions provider. 
LeadingSide ' s services now focus on organizing and indexing the 
structured and unstructured ' " information ' archived by its client 
companies, combining that information with relevant external content and 
making it available via... 
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04754195 Supplier Number: 64200939 (USE FORMAT 7 FOR FULLTEXT) 
BancTec launches eFIRST archive+; New solution addresses the need for 
intelligent information management and access . 

M2 Presswire, pNA 
August 15, 2000 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 737 

providing browser access from anywhere within the enterprise, or 
beyond. Another significant feature is the support of " unstructured 
indexing " of many types of ob'jects* (including scanned images) alongside 
traditional database-type structured indexes . This indexing can be 

enabled 
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DATAWARE TECHNOLOGIES: 6 million documents stored in a streamlined document 
management system 

M2 Presswire, pN/A 
Jan 19, 1998 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 953 

files simply by entering a word, concept, phrase or combination of 
phrases. By offering the most powerful indexing structure available 
today, Dataware BRS/Search" lets* users pinpoint the information they need in 
seconds, even across millions... 
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DATAWARE: Dataware releases^latest version of world's most widely used text 
database system 

M2 Presswire, pN/A 
August 6, 1997 

Language: English Record Type: Fulltext 



Document Type: Newswire; Trac^e^ 
Word Count: 662 



taking advantage of Internet and LAN versions of the product. 
Dataware BRS/Search manages large collections of unstructured 
information , allowing multiple users to quickly and efficiently search, 
retrieve and analyze text, image, graphics, audio and video files simply by 
entering a word, phrase or combination of phrases. By offering the most 
powerful indexing structure available today, Dataware BRS/Search lets 
users pinpoint the information they need in seconds, even across millions 
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. . . aggregate 

on-line original, unabridged, unstructured content (catalogs, 
specifications , 

manuals, installation notes, etc.) in paper on electronic format / index 

this 

unstructured data and allow buyers to search quickly for any item on 
any page' ' * 

in any catalog using natural... 

... WIZNET ! s technology and skilled personnel aggregate on-line original, 
unabridged, unstructured content in paper or electronic format , index 

this 

unstructured data and allow buyers to search for and instantly access 
any item 

on any page in any catalog., j 
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PR Newswire 
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... WIZNET 1 s technology and skilled personnel aggregate on-line 

original, unabridged, unstructured content in paper or electronic format , 

index this unstructured data and allow buyers to search for and 
instantly ^ j 

access any item on any page^in any catalog. . . 
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ANXeBusiness Teams with WIZNET to Add Content Management Solution to ANX 
Services . 

Business Wire, p2093 
Feb 5, 2001 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 864 

... WIZNET 's technology and skilled personnel aggregate on-line 

. original, unabridged, unstructured .content in paper or electronic . format 
index this unstructured data and allow buyers to search for and 
instantly access any item on any page in any catalog... 
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Portum and WIZNET Team to Enhance Auctions with Supplier Content 
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Business Wire, p0087 
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Language: English Record Type: Fulltext 
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Word Count: 585 

... WIZNET ! s technology and skilled personnel aggregate on-line 

original, unabridged, unstructured content in paper or electronic format 

index -this unstructured. data . -and allow buyers to search for and, 
instantly access any item on any page in any catalog... 
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CMD Group's First Source Exchange Features More Than 1500 Building Product 
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Business Wire, p2325 
Jan 18, 2001 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 723 

... aggregate on-line' original, unabridged, unstructured content 

(catalogs, specifications, manuals, installation notes, etc.) in paper on 
electronic format , index this unstructured data and allow buyers t 
search -quickly, for any Ltem on ..any. page in any catalog using natural... 
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. . . Falcon, a message queuing system; Denali, a scripting engine; OLE 

DB, a filter that lets users query structured or unstructured data ; 
and Active Directory, a new directory structure that was formerly 
called OLE DS . 

Pieces of the Active Platform, such as a beta of Viper... 
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500 directory as a separate metadirectory product. 
The announcement will include Locator, a product that lets users 11 
directory -enable" applications linked to the CDS X.500 directory. The 
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according to James Payne, marketing manager for E-mail integration. By 
making its directory available combined with Locator, users will be 
able to search any attribute in the directory, Payne said. Such attributes 



